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Abstract 

A common feature of recent unification- 
based grammar formalisms is that they 
give the user the abihty to define his own 
structures. However, this possibihty is 
mostly limited and does not include non- 
monotonic operations. In this paper we 
show how nonmonotonic operations can 
also be user-defin ed by applying default 
logic ( Reiter, 198C| ) and generalizing previ- 
ous results on nonmonotonic sorts (Young 
and Rounds, 199^ ). 



1 Background 

Most of the more recent unification-based for- 



malisms, such as TFS ( 


Emcle and Zajac, 199C), UD 


([Johnson and Rosncr. 1989), CUF (Dorre and Eisele, 


1991), and FLUF ( 


Stromback, 1994), provide some 



possibility for the user to define constructions of his 
own. This possibility can be more or less powerful 
in different formalisms. There are, however, sev- 
eral constructions proposed as desirable extensions 
to unification grammars that cannot be defined in 
a general and well-defined way in these formalisms. 
One such class of constructions is those that have 
some degree of nonmonotonic behaviour. Examples 
of such constructions are any-values, default- values, 
and some constructions (e.g. constraining eq uations 



completeness and c oherence) used in LFG ( Kaplan 



and Bresnan, 1983|) . 



This paper describes a method that permits the 
user to define such nonmonotonic constructions. 
This is done through generalizing the work on non- 
monotonic sorts (Young and Rounds, 1993). This 



generalization results in a default logic similar to 
( Reiter, 1980| ), but where subsumption and unifica- 



tion are used instead of logical truth and consistency. 

There are three main advantages to Young and 
Rounds' work compared with other approaches to 



default unification (Bouma, 199C; Bouma, 1992| ; 
Russel et al., 1992| ) which justify choosing it as a 
starting point for this work. The first is the sep- 
aration of definite and default information, where 
Young and Rounds are more distinct than the other. 
The second is that the nonmonotonic unification op- 
eration used is order independent. This is achieved 
by separating the unification operation from com- 
puting the nonmonotonic extension, which Young 
and Rounds call explanation. This suggests that all 
the user needs to define when generalizing the ap- 
proach is how a sort is explained. Finally, there is a 
close relationship to Reiter's (1980) default logic. 

This paper starts by providing the minimal prop- 
erties required of a unification-based formalism when 
extending with nonmonotonic definitions. I then de- 
scribe the approach of user-defined nonmonotonicity 
and illustrate how some commonly used nonmono- 
tonic constructions can be defined within it. Finally 

1 conclude with a discussion of the relation to Re- 
iter's default logic and computational properties of 
the approach. 

2 Preliminaries 

There are two main properties that will be assumed 
of a unification-based formalism in order to extend 
it with the possibility of defining nonmonotonic con- 
structions. The first, and most important, is that 
we require a subsumption order on the set S of ob- 
jects denoted by the formalism. Secondly it should 
be possible to define inheritance hierarchies on the 
linguistic knowledge described by the formalism. 

One very plausible subsumption order that can be 
used is the ordinary subsumption lattice of feature 
structures. It is, however, possible to use some other 
kind of subsumption order if that is more suitable for 
the domain to be modelled by the formalism. Exam- 
ples of other subsumption orders that might be use- 
ful are typed feature structures, feature structures 
extended with disjunction, or simply an order based 



on sets and set inclusion. 

In this paper the notation a C 6 is used when- 
ever a subsumes b (i.e. whenever a "is more specific 
than" or "contains more information than" h). Con- 
sequently, a C 6 is used whenever a C 6 but a^h. 

The subsumption order is assumed to be a semi- 
lattice and permits computing a unifier, denoted 
a n 6, corresponding to the greatest lower bound, 
for every pair of elements within it. The element 
corresponding to the bottom of the order relation is 
denoted jail and represents inconsistent information 
or unification failure. 

The second constraint placed on the formalism, 
the possibility of defining an inheritance hierarchy, 
is not essential for the definition of nonmonotonic 
operations. It is, however, very useful when defin- 
ing nonmonotonic constructions. The following no- 
tation will be used for describing an inheritance hi- 
erarchy. 

class the name o] the class; 
isa its parent in the hierarchy; 
requires a structure. 

Thus, each member in the inheritance hierarchy is 
called a class, which is defined by giving it a name 
and a parent in the hierarchy. It is also possible to 
define some constraints, called requirements, which 
must hold for a class. These requirements can be 
both structures in the subsumption order and non- 
monotonic rules. The constraints on classes arc in- 
herited through the hierarchy. Every object in a 
class is assumed to contain at least the information 
given by the constraints specified for it and all its an- 
cestors. For simplicity multiple inheritance between 
classes will not be allowed. This means that two 
classes where none of them is a subclass of the other, 
will always be considered inconsistent and thus yield 
a failure when unified. 

3 User-Defined Nonmonotonicity 

I will now describe how the work by Young and 
Rounds can be generalized to allow the user to de- 
fine nonmonotonic constructions. The main idea in 
their approach is that each node in a feature struc- 
ture consists of a nonmonotonic sort. Such sorts can 
contain two different kinds of information, the ordi- 
nary monotonic information and a set of defaults. If 
we assume that f3 is defined as a default in Young 
and Rounds' work then it is interpreted according to 
the rule: if it is consistent to believe /? then believe 
p. In Reiter's default logic this is expressed with the 
following normal default rule. 

i£ 

/3 



In this paper I want to allow the user to use other 
forms of nonmonotonic inferences and not only the 
normal default rule given above. Therefore, I will 
consider the general form of default rules. An in- 
tuitive reading of a general default rule is, if a is 
believed and it is consistent to believe /3 then be- 
lieve 7. In default logic this is usually expressed as 

a : /3 
7 

The next question is how such defined nonmono- 
tonic rules are going to be interpreted in a unifica- 
tion framework. In ( Reiter, IQSOD , a rule like the 



one above could be applied whenever a is true and 
P is consistent with the information we already have. 
If we assume that V represents the information al- 
ready given this means that the default rule can be 
applied whenever V C a and V n /3 does not yield 
unification failure. When the rule is applied the new 
information obtained would be V n 7. 

In the approach described in this paper, the user is 
allowed to define the actual nonmonotonic rule that 
should be used for a particular operation by using 
the following syntax. 



nonmon 

nameiparameteri , 



. parameter n) '■ when 



In the syntax given above name assigns a name 
to the defined rule, and thus allows the user to 
use nonmonotonic information when defining lin- 
guistic knowledge. The parameters in the rule defi- 
nition are variables, which can be used within the 
actual default rule at the end of the description. 
The user is assumed to assign the nonmonotonic 
information contained in this rule to his linguis- 
tic knowledge by using an expression of the form 
name{parameteri, . . . parametern) . 

The when slot in the rule allows the user to decide 
when the rule is going to be applied, or in Young and 
Rounds' terminology, explained. I will make use of 
two values for the when-slot, immediate and poste- 
rior. Immediate means that the nonmonotonic rule 
is going to be applied each time a full unification task 
has been solved or whenever all information about 
an object in the defined inheritance hierarchy has 
been retrieved. Posterior explanation means that 
the explanation of the rule is postponed until reach- 
ing the result of some external process, for example, 
a parser or generator. There is however no hinder in 
excluding the use of other values here. One could, 
for example, imagine cases where one would want 
different nonmonotonic rules to be explained after 



a completed parse, a generation, or after resolving 
discourse referents. 

Note that although the when slot in the defini- 
tion of a nonmonotonic rule allows the user to define 
when his rule is going to be applied we will still have 
an order independent nonmonotonic unification op- 
erator. This is the case because we follow Young 
and Rounds' approach and separate the unification 
operation from the explanation of a nonmonotonic 
rule. Therefore, what affects the final result of a 
computation is when one chooses to explain default 
rules and not the order of the unification operations 
occurring between such explanations. 

4 Formal Definitions 

In this section I provide give the formal definitions 

for nonmonotonic sorts and how nonmonotonic sorts 
are unified and explained. The definitions are gen- 
eralizations of the definitions in Young and Rounds 
(1993). The notation a ^ & is used to denote the 
fact that a n 6 does not yield unification failure. 

A nonmonotonic sort is a structure containing 
both information from the basic subsumption order 
and information about default rules to be explained 
at a later point in the computation. 

Definition 1 A nonmonotonic sort is a pair (s. A) 

where s £ 5 and A is a set of nonmonotonic 
rules of the form {w, a : /3 7) where w is an 
atom and a, jS and 7 e «S. It is assumed that for 
each nonmonotonic rule 7 C /?, a ~ s, /? ^ s, 

and 7 n s C s. 

As seen by the definition a nonmonotonic sort 
is considered to be a pair of monotonic informa- 
tion from the subsumption order and nonmonotonic 
information represented as a set of nonmonotonic 
rules. The user can assign nonmonotonic informa- 
tion to a nonmonotonic sort by calling a nonmono- 
tonic definition as defined in the previous section. 
The actual nonmonotonic rule occurring within the 
sort is a pair consisting of the when slot and the last 
part of the nonmonotonic definition, with the pa- 
rameter variables instantiated according to the call 
made by the user. 

The second part of this definition contains some 
well-foundedness conditions for a nonmonotonic 
sort. The first condition (7 C 0) is a restriction 
similar to the restriction to normal default rules in 
Reiter's (1980) default logic. This restriction ensures 
that the application of one default rule will never 
cause previously applied default rules to be inappli- 
cable. This makes the procedure for application of 
defaults more efficient and will be further discussed 
in section 6. 



The next two conditions in the definition, a ^ s 
and /3 ~ s, guarantee that the default rule is or can 
be applicable to the nonmonotonic sort. The reason 
for only checking that a ^ s instead of s C a is that 
future unifications can restrict the value of s into 
something more specific than a and thus may make 
the default rule applicable. 

The last condition on a nonmonotonic sort, 7ns C 
s, may seem superfluous. The reason for including 
it is to ensure that applying the default actually re- 
stricts the value of the sort. Otherwise the default 
rule would have no effect and can be removed. Note 
in particular that the above conditions on a non- 
monotonic sort implies that 7 may be fail. 

Given the unification operation of objects within 
the subsumption order and the definition of non- 
monotonic sorts it is possible to define an operation 
for nonmonotonic unification. 

Definition 2 The nonmonotonic unification (Hat) 
of two nonmonotonic sorts (si, Ai) and {s2, A2) 
is the sort {s, A) where 

• s = si n S2 and 

• A ^ {d \ d ^ {w,a : (3 ^ -f), d £ Ai U 
A2, a ~ s, f3 ^ s, and 7 □ s C s} 

The nonmonotonic unification is computed by 
computing the unification of the monotonic parts of 
the two sorts and then taking the union of their non- 
monotonic parts. The extra conditions used when 
forming the union of the nonmonotonic parts of the 
sorts are the same as in the definition of a nonmono- 
tonic sort and their purpose is to remove nonmono- 
tonic rules that are no longer applicable, or would 
have no effect when applied to the sort resulting from 
the unification. 

It is important to note that this generalization of 
the original definition of nonmonotonic imification 
from Young and Rounds (1993) preserves the prop- 
erty of order independence for default unification. 

When using nonmonotonic sorts containing non- 
monotonic rules, we also need to know how to 
merge the monotonic and nonmonotonic informa- 
tion within the sort. I will use the terminology w- 
application for applying one nonmonotonic rule to 
the sort and w- explanation when applying all possi- 
ble rules. 

Definition 3 The nonmonotonic rule 

{w, a : /3 7) is w-applicable to s G «S if: 

• s C a 

• s ~ /? or s = fail 

• sr\j\Zsovs= fail 



The result of the w- application is 7 □ s 

Note that the w in w-application should be consid- 
ered as a variable. This means that only nonmono- 
tonic rules whose first component is w are considered 



as above. I will start with defining default values. 
This is done by defining a nonmonotonic rule default 
for the class value, which is assumed to be the most 
general class in a defined hierarchy. The rule defined 



and that ii is possible 10 choose which nonmonotonic 
rules should be applied in a particular point at some 
computation. 

In addition note that the restriction that 7 !^ /9 
in all nonmonotonic rules and the special cases for 
s = fail ensures that the application of one non- 
monotonic rule never destroys the applicability of a 
previously applied rule. This reduces the amount 
of work required when computing a w-explanation. 
Based on these observations, a sufficient condition 
for w-explanation is defined as follows. 

Definition 4 i is a w-explanation of a nonmono- 
tonic sort (s, A) if it can be computed in the 
following way: 

1. If s = fail or no d G A is w-applicable then 
t — s else 

2. Choose a. d — {w, a : /3 =^ 7} G A such that 
d is w-applicable to s. 

3. Let s = s n 7 and go to 1. 

As shown by the definition, a w-explanation is 
computed by choosing one w-applicable default rule 
at a time and then applying it. Since the defini- 
tion of w-applicability and the condition that 7^/3 
in all nonmonotonic rules ensures that whenever a 
nonmonotonic rule is applied it can never be inappli- 
cable, there is no need to check if the preconditions 
of earlier applied nonmonotonic rules still hold. 

Note also that the choice of which nonmonotonic 
rule to apply in each step of a w-explanation is non- 
deterministic. Consequently, it is possible to have 
conflicting defaults and multiple w-explanations for 
a nonmonotonic sort. 

Note also that the result of a w-explanation is al- 
lowed to be fail. Another option would be to inter- 
pret fail as if the application of the nonmonotonic 
rule should not be allowed. However, as seen in the 
next section, for many uses of nonmonotonic exten- 
sions within unification-based formalisms, the aim is 
to derive failure if the resulting structure does not 
fulfill some particular conditions. This makes it im- 
portant to allow fail to be the result of applying a 
nonmonotonic rule. 

5 Examples 

In this section I will show how some of the most 
common nonmonotonic extensions to unification- 
based grammar can be expressed by defining rules 



here is treated as the one in (Young and Rounds 



1993) 



class value; 

nonmon default (X) : immediate :X => 
X. 

This default rule can be used when defining verbs. 
The rule is used for stating that verbs are active by 
default. I also define the two Swedish verbs skickade 
{sent) and skickades {was sent) to illustrate how this 
rule works. 

class verb; 
isa value; 

requires [form: default (active)] . 

class skickade; 
isa verb; 

requires [lex: skicka] . 

class skickades; 
isa verb; 

requires [lex: skicka, form: 
passive] . 

While retrieving the information for these two 
verbs we will obtain the following two feature struc- 
tures containing nonmonotonic sorts: 

For skickade: 

[lex: skicka, form: 

([], {(immediate , :active => 

active )})] 

For skickades: 

[lex: skicka, form: 

(passive, {(immediate, : active 

active )})] 

Since I have used immediate for the when-slot in 
the definition of the default rule, this nonmonotonic 
rule will be applied immediately after retrieving all 
information about a verb in the hierarchy. For the 
two structures above, the default rule can be ap- 
plied for skickade, since active is consistent with [], 
but not for skickades, since active and passive are 
inconsistent. The result after applying immediate- 
explanation to the two structures above is shown 
below. 



For skickade: 

[lex: skicka, form: active] 



class any_no_value ; 
isa aiiy_value . 



For skickades: 

[lex: skicka, form: passive] 

Another nonmonotonic operation that has been 
used in LFG ( Kaplan and Bresnan, 1983| ) is the 



value constraint (=c) used to check whether a sub- 
structure has a particular value after a completed 
parse. The definition of value constraints as a non- 
monotonic rule makes use of negation, interpreted 
as negation as failure. 

class value; 

nonmon =c(X) :posterior : -iX => fail. 

One use of value constraints in LFG is to assert 
a condition that some grammar rules can only be 
used for passive sentences. I will here assume that 
a representation for verbs where passive verbs have 
the value passive for the attribute form, but where 
other verbs have no value for this attribute. In the 
syntax used in this paper the constraint that a par- 
ticular grammar rule can only be used for passive 
verbs would be expressed as below: 

[form: =c(passive)] 

This would result in the nonmonotonic sort: 



[form: ([], {(posterior , 
^ fail )})] 



-ipassive 



As seen by the definition of =c, the explanation 
for this nonmonotonic sort is postponed and is as- 
sumed to be computed after finding a parse for some 
sentence. This implies that the only case where this 
rule would not apply, and thus not give fail as a re- 
sult, is when the value of form actually is passive. 
For all other values of form, we would have some- 
thing that is consistent with -ipassive and thus the 
nonmonotonic rule will derive failure when applied. 

The next nonmonotonic structure I want to dis- 
cuss is any-values. The inheritance hierarchy is used 
to be able to define any-values in a simple way. 

class value. 

class none; 
isa value. 

class any_value; 
isa value. 

nonmon anyO : posterior 
: any_rio_value => fail. 



In this small hierarchy it is assumed that all pos- 
sible values of a structure is a subtype of value. We 
then divide this into none, which represents that a 
structure cannot have any value and any_value which 
contains all actual values. The class anyjualue is 
then further divided into a class called any_no_value, 
which only contains this single value, and the ac- 
tual values of a structure. The class any_no_value 
should not be used when defining linguistic knowl- 
edge. However, when applying the default rule a 
value that has not been instantiated is compatible 
with this any_no_value. Therefore the default rule 
can make the conclusion that the structure is in- 
consistent, which is what we desire. Note that, as 
soon as a value has been further instantiated into 
a 'real' value, it will no longer be consistent with 
any_no-value, and the nonmonotonic rule cannot ap- 
ply. Two examples will further illustrate this. 

The nonmonotonic sort: 

([], {( posterior, : any_no_value =^ 

, fail )}) 
will be posterior-explained to: 
fail 

While the sort: 

([lex: kalle] , {( posterior, 

: Einy_no_value fail )}) 
will be posterior-explained to: 
[lex: kalle] 

The last nonmonotonic operations I want to dis- 
cuss are completeness and coherence as used in LFG. 
To be able to define these operations I assume the 
inheritance hierarchy above, without the nonmono- 
tonic definition of any. I will, instead, make use of 
the two nonmonotonic definitions below. 

class value; 

nonmon coherence (A) : immediate :[A: 

none] => [A: none]; 
nonmon completeness (A) : posterior 

:[A: any_no_value] => fail. 

The first of these rules is used to check coher- 
ence, and the effect is to add the value none to each 
attribute that has been defined to be relevant for 
coherence check, but has not been assigned a value 
in the lexicon. The second rule is used for check- 
ing completeness and it works similarly to the any- 
definition above. 

Finally, I will show how a fragment of a lexicon can 
be defined according to these rules. Note that in the 



definition of the transitive verb, the value any_value 
is given to the appropriate attributes. This means 
that they are inconsistent with none, and thus, the 
coherence rule cannot be applied. 

concept verb; 

isa any_value; 

requires coherence (subj ) A 

coherence (obj) A 
requires completeness (subj ) A 

completeness (obj ) A .... 

concept treinsitiveverb; 
isa verb; 

requires [subj: any_value, obj: 
ciny_value] . 

6 Relation to Default Logic 

In this section I will discuss the relation of this work 

to Reiter's (1980) default logic. There will also be 
some discussion on the computational properties and 
limitations of the given approach. 

Compared with Reiter's default logic, our notion 
of nonmonotonic sorts corresponds to default theo- 
ries. Unification of nonmonotonic sorts would then 
correspond to merging two default theories into one 
single theory and our notion of explaining a non- 
monotonic sort corresponds to computing the exten- 
sion of a default theory in default logic. 

In default logic there is often a restriction to 
normal- default theories since non-normal default 
theories arc not even semi-decidable. The restric- 
tion in our nonmonotonic rules that 7 C /3 is similar 
to the restriction into normal default rules and cap- 
tures the important property, that the application of 
one nonmonotonic rule should not affect the appli- 
cability of previously applied rules. The decidability 
of the nonmonotonic rules defined here is, however, 
highly dependant on the given subsumption order. 
In particular it is dependent on having a decidable 
unification operation and subsumption check. 

As mentioned previously there is with nonmono- 
tonic sorts, as well as normal default logic, a possi- 
bility of conflicting defaults and thus multiple non- 
monotonic extensions for a structure. One difference 
is that nonmonotonic sorts allow that the applica- 
tion of a nonmonotonic rule leads to fail, i.e. an 
inconsistent structure, while default logic docs not 
allow this outcome. However, since fail is allowed as 
a valid explanation for a nonmonotonic sort, there 
is, as for normal default logic, always at least one 
explanation for a sort. 

The two following examples will illustrate the dif- 
ference between nonmonotonic rules giving multiple 



extensions and nonmonotonic rules giving a single 
explanation fail. 

Example a 

:[c:2] 

[a:l 6:1] [6:2 c:2] 

Example b 

[a:l 6:1] [a:2 6:2] 

In example a the application of one rule, does not 
make the other inapplicable. Thus the only expla- 
nation for a structure is achieved by applying both 
these two rules and results in fail. In example 6, 
however, the application of one of the rules would 
block the application of the other. Thus, in this 
case there are two explanations for the structure de- 
pendant on which of the rules that has been applied 
first. Note that even though there is an order de- 
pendency on the application order of nonmonotonic 
rules this does not affect the order independency 
on nonmonotonic unification between application of 
nonmonotonic rules. 

Allowing multiple extensions gives a higher com- 
putational complexity than allowing only theories 
with one extension. Since it is the user who defines 
the actual nonmonotonic theory multiple extensions 
must be allowed and it must be considered a task for 
the user to define his theory in the way he prefers. 

7 Improvements of the Approach 

I will start with two observations regarding the defi- 
nitions given in section 3. First, it is possible to gen- 
eralize these definitions to allow the first component 
of a nonmonotonic sort to contain substructures that 
are also nonmonotonic sorts. With the generalized 
versions of the definitions explanations that simulta- 
neously explain all substructures of a nonmonotonic 
sort will be considered. Note that the explanation of 
default rules at one substructure might affect the ex- 
planation of rules at other substructures. Therefore 
the order on which nonmonotonic rules at different 
substructures are applied is important and all pos- 
sible application ordcTS must hQ considca'cd. 

Considering unification of nonmonotonic sorts it 
is not necessary to simplify the nonmonotonic part 
of the resulting sort. A = Ai U A2 can be defined as 
an alternative to the given definition. This alternate 
definition is useful for applications where the simpli- 
fication of nonmonotonic sorts by each unification is 
expected to be more expensive than the extra work 



needed to explain a sort whose nonmonotonic part 
is not simplified. 

As stated previously, nonmonotonic sorts allow 
multiple explanations of a nonmonotonic sort. If de- 
sired, it would be fairly easy to add priorities to the 
nonmonotonic rules, and thus induce a preference 
order on explanations. 

One further example will illustrate that it is also 
possible to define negation as failure with nonmono- 
tonic rules. An intuitive interpretation of the defined 
rule below is that if X is believed (V E X), failure 
should be derived. 

nonmon not (X) : immediate X => fail; 

However, if this definition is to be really useful 
we must also allow one definition of a nonmonotonic 
rule to make use of other nonmonotonic rules. In our 
original definition we said that the nonmonotonic 
rule above should be applied if V ~ -^X. This can 
be generalized to the case where -^X is a nonmono- 
tonic rule if we extend the definition of ~ to also 
mean that the application (or explanation) of the 
not rule at this node does not yield failure. However, 
this generalization is outside default logic. There- 
fore, its computational properties are unclear and 
needs more investigation. 

8 Conclusion 

In this paper I have proposed a method allow- 
ing the user to define nonmonotonic operations in 
a unification-based grammar formalism. This was 
done by generalizing the work on nonmonotonic 



sorts (Young and Rounds, 1993) to allow not only 
normal defaults rules but general default rules that 
are defined by the user. The method has a very 
close relation to Reiter (1980). We also noted that 
the method can be applied to all domains of struc- 
tures where we have a defined subsumption order 
and unification operation. 

The generality of the approach was demonstrated 
by defining some of the most commonly used non- 
monotonic operations. We also gave formal defini- 
tions for the approach and provided a discussion on 
its computational properties. 
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